NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Secure Normal Form: Mediation Among Cross Cryptographic Leakages in Encrypted Databases

Zhang, Shufan; He, Xi; Kundu, Ashish; Mehrotra, Sharad; Sharma, Shantanu (May 2025, IEEE)

Full Text Available
Meaningful Data Erasure in the Presence of Dependencies

https://doi.org/10.14778/3748191.3748206

Chakraborty, Vishal; Kaminsky, Youri; Mehrotra, Sharad; Naumann, Felix; Nawab, Faisal; Pappachan, Primal; Sadoghi, Mohammad; Venkatasubramanian, Nalini (June 2025, Proceedings of the VLDB Endowment)

Data regulations like GDPR require systems to support data erasure but leave the definition of erasure open to interpretation. This ambiguity makes compliance challenging, especially in databases where data dependencies can lead to erased data being inferred from remaining data. We formally define a precise notion of data erasure that ensures any inference about deleted data, through dependencies, remains bounded to what could have been inferred before its insertion. We design erasure mechanisms that enforce this guarantee at minimal cost. Additionally, we explore strategies to balance cost and throughput, batch multiple erasures, and proactively compute data retention times when possible. We demonstrate the practicality and scalability of our algorithms using both real and synthetic datasets.
more » « less
Full Text Available
Modeling Inhabited Smart Spaces to Support Interoperable IoT-Based Applications

https://doi.org/10.1109/MDM65600.2025.00025

Yus, Roberto; Lahjouji, Nada; Bouloukakis, Georgios; Mehrotra, Sharad; Venkatasubramanian, Nalini (June 2025, IEEE 2025 26th IEEE International Conference on Mobile Data Management (MDM))

IoT deployments in smart spaces can enable the development of useful services for their inhabitants. However, the diversity of smart spaces and their sensor infrastructures makes it challenging to develop space-agnostic applications. Moreover, existing schemas addressing interoperability challenges often lack the vocabulary needed to represent the integration of smart space systems and their inhabitants. We present a schema to annotate inhabited smart spaces in support of inhabitant-oriented applica- tions. Our schema integrates well-known ontologies to represent inhabitants, events/activities, and the space itself, along with their interconnections. It also supports the representation of uncertain information from IoT and mobile sensors (e.g., a person’s location or occupancy/attendance at an event). Additionally, we introduce an annotation tool that uses an easy-to-use GUI to describe a smart space based on our schema. We demonstrate the potential of our approach through a series of SPARQL queries and a system deployed at the UCI campus that annotates sensor data to support a space-agnostic occupancy monitoring application.
more » « less
Full Text Available
ProBE: Proportioning Privacy Budget for Complex Exploratory Decision Support

Lajouji, Nada; Ghayyur, Sameera; He, Xi; Mehrotra, Sharad (July 2024, ACM)

Full Text Available
PLAQUE: Automated Predicate Learning at Query Time

https://doi.org/10.1145/3639301

Lin, Yiming; Mehrotra, Sharad (March 2024, Proceedings of the ACM on Management of Data)

Predicate pushing down is a key optimization used to speed up query processing. Much of the existing practice is restricted to pushing predicates explicitly listed in the query. In this paper, we consider the challenge of learning predicates during query execution which are then exploited to accelerate execution. Prior related approaches with a similar goal are restricted (e.g., learn only from only join columns or from specific data statistics). We significantly expand the realm of predicates that can be learned from different query operators (aggregations, joins, grouping, etc.) and develop a system, entitled PLAQUE, that learns such predicates during query execution. Comprehensive evaluations on both synthetic and real datasets demonstrate that the learned predicate approach adopted by PLAQUE can significantly accelerate query execution by up to 33x, and this improvement increases to up to 100x when User-Defined Functions (UDFs) are utilized in queries.
more » « less
Full Text Available
BatchIT: Intelligent and Efficient Batching for IoT Workloads at the Edge

https://doi.org/10.1109/NOMS59830.2024.10575298

Wang, Guoxi; Hildebrant, Ryan; Chio, Andrew; Mehrotra, Sharad; Venkatasubramanian, Nalini (May 2024, IEEE)

Next-generation stream processing systems for community scale IoT applications must handle complex nonfunctional needs, e.g. scalability of input, reliability/timeliness of communication and privacy/security of captured data. In many IoT settings, efficiently batching complex workflows remains challenging in resource-constrained environments. High data rates, combined with real-time processing needs for applications, have pointed to the need for efficient edge stream processing techniques. In this work, we focus on designing scalable edge stream processing workflows in real-world IoT deployments where performance and privacy are key concerns. Initial efforts have revealed that privacy policy execution/enforcement at the edge for intensive workloads is prohibitively expensive. Thus, we leverage intelligent batching techniques to enhance the performance and throughput of streaming in IoT smart spaces. We introduce BatchIT, a processing middleware based on a smart batching strategy that optimizes the trade-off between batching delay and the end-to-end delay requirements of IoT applications. Through experiments with a deployed system we demonstrate that BatchIT outperforms several approaches, including micro-batching and EdgeWise, while reducing computation overhead.
more » « less
Full Text Available
SES: Bridging the Gap Between Explainability and Prediction of Graph Neural Networks

https://doi.org/10.1109/ICDE60146.2024.00229

Huang, Zhenhua; Li, Kunhao; Wang, Shaojie; Jia, Zhaohong; Zhu, Wentao; Mehrotra, Sharad (May 2024, IEEE)

Full Text Available
Water-COLOR: Water-COnservation using a Learning-based Optimized Recommender

https://doi.org/10.1109/SMARTCOMP61445.2024.00034

Zhang, GuangXue; Feldman, David L; Lin, Yiming; Mehrotra, Sharad; Venkatasubramanian, Nalini; Drew, Thayer; Sentovich, Kim; Veranth, Owen (June 2024, IEEE)

Efficient water use, particularly in the realm of irrigation, has emerged as a critical concern in regions suffering from persistent drought, such as California and Florida. With the advent of smart irrigation controllers encouraged by environmental policies, a new paradigm of water management is gaining traction. Among these, the Rachio smart controller has garnered significant attention. However, without direct feedback or actual water usage data, optimizing these irrigation systems for enhanced efficiency remains challenging. This paper introduces Water-COLOR, a novel recommendation system integrated within the Rachio smart controller's framework to address this challenge. The system leverages similar landscape profiles to suggest irrigation schedules that are both water-efficient and user-preferable. By analyzing manual user interactions with the controller, Water-COLOR infers user satisfaction, which, along with estimated water usage, informs the adaptation of irrigation plans. The system eschews the need for additional sensors, thereby reducing infrastructure requirements. Our evaluation demonstrates consistent performance across diverse climatic regions and indicates that the system's recommendations could significantly contribute to water conservation efforts. The results not only showcase the potential of Water-COLOR to enhance the efficiency of existing smart irrigation systems but also open avenues for deploying real-time, data-driven environmental solutions.
more » « less
Full Text Available
ZIP: Lazy Imputation during Query Processing

https://doi.org/10.14778/3617838.3617841

Lin, Yiming; Mehrotra, Sharad (September 2023, Proceedings of the VLDB Endowment)

This paper develops a query-time missing value imputation framework, entitled ZIP, that modifies relational operators to be imputation aware in order to minimize the joint cost of imputing and query processing. The modified operators use a cost-based decision function to determine whether to invoke imputation or to defer to downstream operators to resolve missing values. The modified query processing logic ensures results with deferred imputations are identical to those produced if all missing values were imputed first. ZIP includes a novel outer-join based approach to preserve missing values during execution, and a bloom filter based index to optimize the space and running overhead. Extensive experiments on both real and synthetic data sets demonstrate 10 to 25 times improvement when augmenting the state-of-the-art technology, ImputeDB, with ZIP-based deferred imputation. ZIP also outperforms the offline approach by up to 19607 times in a real data set.
more » « less
Full Text Available
Veil: A Storage and Communication Efficient Volume-Hiding Algorithm

https://doi.org/10.1145/3626759

Han, Shanshan; Chakraborty, Vishal; Goodrich, Michael T; Mehrotra, Sharad; Sharma, Shantanu (December 2023, Proceedings of the ACM on Management of Data)

This paper addresses volume leakage (i.e., leakage of the number of records in the answer set) when processing keyword queries in encrypted key-value (KV) datasets. Volume leakage, coupled with prior knowledge about data distribution and/or previously executed queries, can reveal both ciphertexts and current user queries. We develop a solution to prevent volume leakage, entitled Veil, that partitions the dataset by randomly mapping keys to a set of equi-sized buckets. Veil provides a tunable mechanism for data owners to explore a trade-off between storage and communication overheads. To make buckets indistinguishable to the adversary, Veil uses a novel padding strategy that allow buckets to overlap, reducing the need to add fake records. Both theoretical and experimental results show Veil to significantly outperform existing state-of-the-art.
more » « less
Full Text Available

« Prev Next »

Search for: All records